A new method for detecting human recombination hotspots and its applications to the HapMap ENCODE data.

نویسندگان

  • Jun Li
  • Michael Q Zhang
  • Xuegong Zhang
چکیده

Computational detection of recombination hotspots from population polymorphism data is important both for understanding the nature of recombination and for applications such as association studies. We propose a new method for this task based on a multiple-hotspot model and an (approximate) log-likelihood ratio test. A truncated, weighted pairwise log-likelihood is introduced and applied to the calculation of the log-likelihood ratio, and a forward-selection procedure is adopted to search for the optimal hotspot predictions. The method shows a relatively high power with a low false-positive rate in detecting multiple hotspots in simulation data and has a performance comparable to the best results of leading computational methods in experimental data for which recombination hotspots have been characterized by sperm-typing experiments. The method can be applied to both phased and unphased data directly, with a very fast computational speed. We applied the method to the 10 500-kb regions of the HapMap ENCODE data and found 172 hotspots among the three populations, with average hotspot width of 2.4 kb. By comparisons with the simulation data, we found some evidence that hotspots are not all identical across populations. The correlations between detected hotspots and several genomic characteristics were examined. In particular, we observed that DNaseI-hypersensitive sites are enriched in hotspots, suggesting the existence of human beta hotspots similar to those found in yeast.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SequenceLDhot: detecting recombination hotspots

MOTIVATION There is much local variation in recombination rates across the human genome--with the majority of recombination occurring in recombination hotspots--short regions of around approximately 2 kb in length that have much higher recombination rates than neighbouring regions. Knowledge of this local variation is important, e.g. in the design and analysis of association studies for disease...

متن کامل

Algorithms for Inferring Recombination and Association Mapping in Populations

A current high priority research goal is to understand how genetic variations influence complex genetic diseases (or more generally traits). Recombination is an important biological and genetic process that plays a major role in the logic behind association mapping, a currently intensely studied method widely hoped to efficiently find genes (alleles) associated with complex diseases. Recently, ...

متن کامل

Disentangling linkage disequilibrium and linkage from dense single-nucleotide polymorphism trio data.

Parent-offspring trios are widely collected for disease gene-mapping studies and are being extensively genotyped as part of the International HapMap Project. With dense maps of markers on trios, the effects of LD and linkage can be separated, allowing estimation of recombination rates in a model-free setting. Here we define a model-free multipoint method on the basis of dense sequence polymorph...

متن کامل

A novel method with improved power to detect recombination hotspots from polymorphism data reveals multiple hotspots in human genes.

We introduce a new method for detection of recombination hotspots from population genetic data. This method is based on (a) defining an (approximate) penalized likelihood for how recombination rate varies with physical position and (b) maximizing this penalized likelihood over possible sets of recombination hotspots. Simulation results suggest that this is a more powerful method for detection o...

متن کامل

LDsplit: Screening cis-regulatory motifs stimulating meiotic recombination hotspots by analysis of DNA sequence polymorphisms

In this newer version, LDsplit was implemented in Java language with a user-friendly interface as shown in Figure 2. This greatly facilitates the analysis of data, providing users with an integrative view of genomic context of hotspots (e.g. flanking DNA sequences). We tested the newly implemented LDsplit on HapMap SNP data to predict the association of the FG11 SNP with the DNA2 hotspot, previ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • American journal of human genetics

دوره 79 4  شماره 

صفحات  -

تاریخ انتشار 2006